Large-scale identification and characterization of alternative splicing variants of human gene transcripts using 56 419 completely sequenced and manually annotated full-length cDNAs

نویسندگان

  • Jun-ichi Takeda
  • Yutaka Suzuki
  • Mitsuteru Nakao
  • Roberto A. Barrero
  • Kanako O. Koyanagi
  • Lihua Jin
  • Chie Motono
  • Hiroko Hata
  • Takao Isogai
  • Keiichi Nagai
  • Tetsuji Otsuki
  • Vladimir Kuryshev
  • Masafumi Shionyu
  • Kei Yura
  • Mitiko Go
  • Jean Thierry-Mieg
  • Danielle Thierry-Mieg
  • Stefan Wiemann
  • Nobuo Nomura
  • Sumio Sugano
  • Takashi Gojobori
  • Tadashi Imanishi
چکیده

We report the first genome-wide identification and characterization of alternative splicing in human gene transcripts based on analysis of the full-length cDNAs. Applying both manual and computational analyses for 56,419 completely sequenced and precisely annotated full-length cDNAs selected for the H-Invitational human transcriptome annotation meetings, we identified 6877 alternative splicing genes with 18 297 different alternative splicing variants. A total of 37,670 exons were involved in these alternative splicing events. The encoded protein sequences were affected in 6005 of the 6877 genes. Notably, alternative splicing affected protein motifs in 3015 genes, subcellular localizations in 2982 genes and transmembrane domains in 1348 genes. We also identified interesting patterns of alternative splicing, in which two distinct genes seemed to be bridged, nested or having overlapping protein coding sequences (CDSs) of different reading frames (multiple CDS). In these cases, completely unrelated proteins are encoded by a single locus. Genome-wide annotations of alternative splicing, relying on full-length cDNAs, should lay firm groundwork for exploring in detail the diversification of protein function, which is mediated by the fast expanding universe of alternative splicing variants.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

H-DBAS: Alternative splicing database of completely sequenced and manually annotated full-length cDNAs based on H-Invitational

The Human-transcriptome DataBase for Alternative Splicing (H-DBAS) is a specialized database of alternatively spliced human transcripts. In this database, each of the alternative splicing (AS) variants corresponds to a completely sequenced and carefully annotated human full-length cDNA, one of those collected for the H-Invitational human-transcriptome annotation meeting. H-DBAS contains 38,664 ...

متن کامل

Low conservation and species-specific evolution of alternative splicing in humans and mice: comparative genomics analysis using well-annotated full-length cDNAs

Using full-length cDNA sequences, we compared alternative splicing (AS) in humans and mice. The alignment of the human and mouse genomes showed that 86% of 199 426 total exons in human AS variants were conserved in the mouse genome. Of the 20 392 total human AS variants, however, 59% consisted of all conserved exons. Comparing AS patterns between human and mouse transcripts revealed that only 4...

متن کامل

Identification and Functional Analyses of 11 769 Full-length Human cDNAs Focused on Alternative Splicing

We analyzed diversity of mRNA produced as a result of alternative splicing in order to evaluate gene function. First, we predicted the number of human genes transcribed into protein-coding mRNAs by using the sequence information of full-length cDNAs and 5'-ESTs and obtained 23 241 of such human genes. Next, using these genes, we analyzed the mRNA diversity and consequently sequenced and identif...

متن کامل

Expression Pattern of Alternative Splicing Variants of Human Telomerase Reverse Transcriptase (hTERT) in Cancer Cell Lines Was not Associated with the Origin of the Cells

Telomerase and systems controlling their activity have been of great attention. There are controversies regarding the role of the alternative splicing forms of the human telomerase reverse transcriptase (hTERT), the catalytic subunit of telomerase. Therefore, the correlation between telomerase enzyme activity, the abundance of alternatively spliced variants of hTERT and doubling time of a seri...

متن کامل

The New Phase of Transcriptome Analysis

We have established a large-scale system named CAGE (CAP-based analysis of gene expression), for identifying the 5' Transcription Start Sites (TSS) and promoter regions. With this system we have obtained over 10,000,000 CAGE tags from human and mouse. We have also determined the sequences of more than 100,000 full-length cDNAs from mouse, which were subsequently used to study the transcriptiona...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Nucleic Acids Research

دوره 34  شماره 

صفحات  -

تاریخ انتشار 2006